Local Citation Recommendation with Hierarchical-Attention Text Encoder and SciBERT-Based Reranking
نویسندگان
چکیده
The goal of local citation recommendation is to recommend a missing reference from the context and optionally also global context. To balance tradeoff between speed accuracy in large-scale paper database, viable approach first prefetch limited number relevant documents using efficient ranking methods then perform fine-grained reranking more sophisticated models. In that vein, BM25 has been found be tough-to-beat prefetching, which why recent work focused mainly on step. Even so, we explore prefetching with nearest neighbor search among text embeddings constructed by hierarchical attention network. When coupled SciBERT reranker fine-tuned tasks, our Attention encoder (HAtten) achieves high recall for given candidates reranked. Consequently, requires fewer rerank, yet still state-of-the-art performance various datasets such as ACL-200, FullTextPeerRead, RefSeer, arXiv.
منابع مشابه
Incremental Reranking for Hierarchical Text Classification
The top-down method is efficient and commonly used in hierarchical text classification. Its main drawback is the error propagation from the higher to the lower nodes. To address this issue we propose an efficient incremental reranking model of the top-down classifier decisions. We build a multiclassifier for each hierarchy node, constituted by the latter and its children. Then we generate sever...
متن کاملContent-Based Citation Recommendation
We present a content-based method for recommending citations in an academic paper draft. We embed a given query document into a vector space, then use its nearest neighbors as candidates, and rerank the candidates using a discriminative model trained to distinguish between observed and unobserved citations. Unlike previous work, our method does not require metadata such as author names which ca...
متن کاملA Hierarchical Contextual Attention-based GRU Network for Sequential Recommendation
Sequential recommendation is one of fundamental tasks for Web applications. Previous methods are mostly based on Markov chains with a strong Markov assumption. Recently, recurrent neural networks (RNNs) are getting more and more popular and has demonstrated its effectiveness in many tasks. The last hidden state is usually applied as the sequence’s representation to make recommendation. Benefit ...
متن کاملLarge-scale Structural Reranking for Hierarchical Text Categorization
Current hierarchical text categorization (HTC) methods mainly fall into three directions: (1) Flat one-vs.-all approach, which flattens the hierarchy into independent nodes and trains a binary one-vs.-all classifier for each node. (2) Top-down method, which uses the hierarchical structure to decompose the entire problem into a set of smaller subproblems, and deals with such sub-problems in top-...
متن کاملCitation Recommendation via Proximity Full-Text Citation Analysis and Supervised Topical Prior
Currently the many publications are now available electronically and online, which has had a significant effect, while brought several challenges. With the objective to enhance citation recommendation based on innovative text and graph mining algorithms along with full-text citation analysis, we utilized proximitybased citation contexts extracted from a large number of full-text publications, a...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Lecture Notes in Computer Science
سال: 2022
ISSN: ['1611-3349', '0302-9743']
DOI: https://doi.org/10.1007/978-3-030-99736-6_19